MRC  Technical  Summary  Report  #2796 


STUDIES  IN  QUALITY  1MPK0VEMKNT  X: 

i)I:;i>i;ic;ion  effects  prom  i'ract ionai. 
pi-;:;  ions 

(Sooryo  E.P.  Box  and  K.  Daniel  Meyer 


Mathematics  Research  Center 
University  of  Wisconsin— Madison 
610  Walnut  Street 
Madison,  Wisconsin  53705 


February  19 ri 5 


(Received  February  26 ,  1985) 


OTIC  FILE  COPY 


Sponsored  by 

U.  S.  Army  Research  Office 
P.  O.  Box  12211 
Research  Triangle  Park 
North  Carolina  27709 


Approved  for  public  release 
Distribution  unlimited 


DTIC 

ELECTS 

MAy  9  1985 


85  4  10  12 


UNIVERSITY  OF  WSICONSIN-MADISON 
MATHEMATICS  RESEARCH  CENTER 


STUDIES  IN  QUALITY  IMPROVEMENT  I: 

DISPERSION  EFFECTS  FROM  FRACTIONAL  DESIGNS 

George  E.P.  Box  and  R.  Daniel  Meyer* 

Technical  Summary  Report  #2796 
February  1985 

ABSTRACT 

Fractional  factorial  designs  have  been  used  successfully  in  industry  and 
elsewhere  to  detect  and  estimate  sparse  factor  effects.  The  effects  usually 
envisioned  measure  changes  in  location  associated  with  the  experimental 
factors.  In  this  paper  we  consider  the  possibility  of  detecting  and  estimating 
sparse  dispersion  effects  measuring  changes  in  variance  associated  with  the 
factors. 
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SIGNIFICANCE  AND  EXPLANATION 


Unreplicated  fractional  factorial  designs  are  frequently  employed  as 
screening  designs  in  industrial  experimentation.  Current  methods  of  analysis 
are  primarily  concerned  with  determining  the  effect  of  experimental  variables 
on  the  mean  value  or  location  of  the  response  y  in  terms  of  the  usual  main 
effects  and  interactions  which  we  will  here  call  location  effects.  We 
consider  in  this  paper  the  possibility  that  the  variables  may  also  affect  the 
variance  of  y  analogously  through  dispersion  effects.  The  nature  of  the 
alias  relationships  between  location  and  dispersion  effects  is  discussed  and  a 
method  developed  for  identification  of  dispersion  effects  when  location 
effects  are  also  present. 


The  responsibility  for  the  wording  and  views  expressed  in  this  descriptive 
summary  lies  with  MRC,  and  not  with  the  authors  of  this  report. 


STUDIES  IN  QUALITY  IMPROVEMENT  I: 
DISPERSION  EFFECTS  FROM  FRACTIONAL  DESIGNS 


George  E.P.  Box  and  R.  Daniel  Meyer* 

1 .  INTRODUCTION 

Table  1  shows  In  summary  a  highly  fractionated  two-level  factorial  design  employed  as 
a  screening  design  In  an  off-line  welding  experiment  performed  by  the  National  Railway 
Corporation  of  Japan  (Taguchi  and  Wu,  1980).  To  the  right  of  the  table  is  shown  the 
observed  tensile  strength  of  the  weld,  one  of  several  quality  characteristics  measured. 

The  authors  assumed  that,  in  addition  to  main  effects  only  the  interactions  AC,  AG, 
AH,  and  GH  might  be  present.  On  that  supposition,  all  nine  main  effects  and  the  four 
selected  two-factor  interactions  can  be  separately  estimated  by  appropriate  orthogonal 
contrasts  and  the  two  remaining  contrasts  corresponding  to  the  columns  labelled  e1  and 
ej  measure  only  experimental  error.  In  the  last  row  of  the  table  are  shown  the  grand 
average  and  the  fifteen  effect  contrasts  calculated  in  the  usual  manner.  In  this  paper 
these  will  be  referred  to  as  "location"  effects.  They  are  plotted  in  a  dot  diagram  below 
the  table.  A  normal  probability  plot  (Daniel  1959,  1976)  shows  thirteen  effects  roughly 
following  a  straight  line  with  main  effects  B  and  C,  falling  markedly  off  the  line. 
This  suggests  that,  over  the  ranges  studied,  only  factors  B  and  C  affect  tensile 
location  by  amounts  not  readily  attributed  to  noise. 

On  the  assumption,  then,  that  B  and  c  are  the  only  important  location  effects,  the 

2 

sixteen  runs  could  be  regarded  as  four  replications  of  a  2  factorial  design  in  factors 
B  and  C  only.  However  when  the  results  are  plotted  in  Figure  1  so  as  to  reflect  this, 
inspection  suggests  the  existence  of  a  dramatic  effect  of  a  different  kind  apparently  not 
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A  fractional  two- level  design  used  in  a  welding  experiment  with  the  estimated 
location  effects  plotted  as  a  dot  diagram.  For  convenience  in  later  discussion 
the  design  layout  has  been  recast  so  that  the  columns  as  well  as  the  rows  follow 
standard  factorial  order.  Also  the  preheating  variable  has  been  labelled  J  so 
that,  as  is  customary,  the  letter  X  may  be  used  for  the  initial  column  of  plus 


previously  noticed.  When  factor  C  is  at  its  minus  level,  corresponding  to  the  use  of  an 
alternative  material,  the  spread  of  the  data  appears  much  larger  than  when  C  is  at  its 
plus  level.  Thus  in  addition  to  detecting  shifts  in  location  due  to  B  and  C,  the 
experiment  may  also  have  detected  what  we  will  call  a  dispersion  effect  due  to  C. 

This,  of  course,  is  not  the  only  possible  explanation  of  the  data.  If,  instead  of 
adopting  the  assumptions  of  the  authors,  it  has  been  supposed,  for  example,  that  all  two- 
factor  interactions  might  be  appreciable,  then,  because  of  the  identity  -BCD  = 
lx  14  x  15  *  i  in  the  defining  relation  of  the  design,  the  large  contrasts  associated 
with  columns  14  and  15  could  have  been  due  to  B  and  C  as  postulated  or 
alternatively  to  B  and  BD  or  to  C  and  CD.  The  data  might  also  be  accounted  for  by 
supposing  that  certain  of  the  tested  factors  other  than  B  and  C  affected  C  only  at 
its  minus  level.  Analysis  of  the  eight  runs  made  at  the  minus  level  of  C  do  not  support 
any  simple  explanation  of  this  kind.  However  screening  designs  should  normally  be 
employed  in  a  sequential  process  of  investigation  where  the  alternative  possibilities 
which  they  offer  may  be  resolved  in  subsequent  experimentation.  (See,  for  example,  the 
discussion  of  Tippett's  cotton  experiment  on  pages  88-90  of  Fisher  (1966)  also  Box,  Hunter 
and  Hunter  197B.)  In  an  ongoing  investigation  therefore  such  possibilities  ought  to  be 
considered  for  further  study.  We  shall  pursue  the  implications  of  the  simplest 
explanation  here  while  inviting  the  reader  to  bear  all  the  above  provisos  in  mind. 


2.  DISPERSION  EFFECTS 


Fractional  arrangement  and  other  orthogonal  arrays  introduced  by  Finney  (1975), 
Plackett  and  Burman  (1946)  and  Rao  (1947)  have  frequently  been  used  in  industry:  see  for 
example  Davies  (1954),  Daniel  (1976),  Box,  Hunter  &  Hunter  (1978).  In  particular  they  are 
employed  as  screening  designs  when  it  is  believed  that  the  large  effects  it  is  desired  to 
detect  come  from  only  a  small  number  of  the  tested  factors.  This  may  be  called  the 
hypothesis  of  effect  sparsity.  The  tensile  data  suggests  the  general  possibility  that  the 
use  of  unreplicated  fractional  designs  might  provide  an  economical  way  of  detecting  sparse 
dispersion  effects  as  well  as  sparse  location  effects.  This  idea  is  pursued  in  the 
remainder  of  this  paper.  The  procedures  we  discuss  are  for  the  identification  stage  of 
the  problem-solving  iteration  (see  for  example  Box  &  Jenkins,  1976)  suggesting  tentatively 
which  factors  might  have  location  and  which  dispersion  effects  .  Efficient  maximum 
likelihood  estimation  for  fitting  an  identified  model  is  briefly  discussed  at  the  end  of 
the  paper. 

Consider  again  the  design  of  Table  1.  There  are  16  runs  from  which  16  quantities — 
the  average  and  15  effect  contrasts — have  been  calculated.  Now  if  we  were  interested  in 

possible  dispersion  effects  we  could  also  calculate  15  variance  ratios.  For  example,  for 

2 

the  i'th  column  we  could  compute  the  sample  variance  s  (i-)  from  the  eight  observations 

2 

associated  with  a  minus  sign  and  compare  it  with  the  sample  variance  s  (i+)  from  the 

2 

eight  observations  associated  with  a  plus  sign,  to  provide  the  ratio  *  s  (i+)/ 

2 

s  ( i— )  .  If  this  is  done  for  the  fifteen  contrast  columns  of  welding  data  the  values  for 
lnF^  given  in  Figure  2(a)  are  obtained. 

It  will  be  recalled  that  in  the  earlier  analysis  a  large  dispersion  effect  was 
associated  with  factor  C  (column  15).  However  in  Figure  2(a)  the  dispersion  effect  for 
this  factor  is  not  especially  extreme,  instead  the  effect  for  factor  D  (column  1)  stands 
out  from  all  the  rest.  We  will  see  how  this  may  be  accounted  for  by  the  aliasing  of 
location  and  dispersion  effects  which  we  now  consider  in  a  preliminary  way. 


Since  the  sixteen  location  effects  are  obtained  by  non-singular  linear  transformation 


of  the  original  sixteen  data  values,  calculated  dispersion  effects  must  be  functions  of 
the  location  effects.  The  general  nature  of  the  location-dispersion  aliasing  is  explained 
in  the  section  3  which  follows.  It  is  shown  that  each  dispersion  effect  is  a  ratio  of  /CXJJ.  >vuj_ 
‘CL sums  and  differences  of  the  location  effects.  For  immediate  illustration  equation  (1) 

shows  the  identity  that  exists  for  the  F  ratio  associated  with  factor  D,  and  hence  for 
column  1,  of  the  design.  In  this  expression  i  is  used  to  indicate  the  location  contrast 
associated  with  the  ith  column. 

-.2  ..2  ..2  »  .  2  .«2  »  «  2  ..2 
_  (2+3)  +  (4+5)  +  (6+7)  +  (8+9)  +  (10-MI)  +  (12  +  13)  +  (14+15) 

D  1  ..2  -.2  ..2  *«2  .»2  «*2  .,2  1  ’  ’ 

(2-3)  +  (4-5)  +  (6-7)  +  (8-9)  +  (10-11)  +  (12-13)  +  (14-15) 

This  equation  shows  in  particular  how  the  extreme  value  for  FD,  can  be  accounted  by 
the  location  effects  B  =  14  =  2.15  and  -C  =  15  =  ^.10  whose  squared  sum  and  squared 
difference  appear  respectively  in  its  numerator  and  denominator. 

A  natural  way  to  try  to  eliminate  such  aliasing  is  to  compute  variances  from  the 
residuals  obtained  after  least  squares  modelling  of  large  location  effects.  We  show  in 
Section  3  that  after  such  elimination  alias  relations  such  as  equation  1  remain  of  the 
same  form  but  with  location  effects  from  eliminated  variables  removed.  Dispersion  effects 
F^  calculated  from  residuals  after  eliminating  the  location  effects  of  B  and  C  are 
shown  in  Figure  2(b).  It  is  seen  that  an  extreme  dispersion  effect  is  now  associated 
with  C  agreeing  with  our  earlier  analysis. 
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3.  DISPERSION  AND  LOCATION  ALIASING 


3.1.  Identities  Existing  Between  Dispersion  and  Location  Effects 

In  order  to  study  the  identity  relations  existing  between  location  and  dispersion 
effects  consider  an  n  *  n  orthogonal  array  with  n  =  2^  columns  of  -1*s  and  +1*8 
labelled  x^ ,x ^ , . . . >Xn_ , •  Let  gg  =  l  be  a  column  of  +1's  and  the  remaining  columns 
delineate  the  usual  contrasts  for  the  main  effects  and  interactions  of  a  2^  factorial 
design.  In  general  we  suppose  that  the  array  is  to  be  used  either  as  a  fractional 

or  as  a  full  factorial  to  test  k  factors,  so  that  q  =  k  -  p  with  p  >  0. 

To  generate  the  columns  of  a  2^  orthogonal  array  it  is  convenient  to  begin  by 
writing  a  full  factorial  for  q  letters  employing  Yates'  standard  order  columnwise.  We 
then  label  columns  from  zero  to  n-1  (as  illustrated  below  and  in  Table  1  for  q  =*  3  and 
q  =  4  respectively).  In  practice  a  design  with  n  =  8  would  usually  be  too  small  to 
allow  variance  effects  to  be  usefully  studied.  We  employ  it  here  only  to  illustrate  the 
argument . 
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As  is  well 

known 

an  array  generated 

in  this 

way  may  be 

used  as 

a  full  factorial  or  as  a 

f  ract ional 

design 

.  For 

example , 

associating  three  factors  with  columns  1, 

2 ,  4  above 

reproduces 

the  2 

3  factorial,  four 

factors 

associated 

with  columns  1,  2, 

4,  7  produces  a 

r'4 

4-  1 

.7-4 

fractional 

,  seven  factors 

associated 

with  columns  1  through  7  produces  a 

f  ract ional 

The 

roman 

subscript 

is 

used  to 

denote  the 

design 

resolution: 

that  is  the 

j 

length  of 

the  shortest 

word  in  the 

defining 

notation  (see,  for 

example  Box 

&  Hunter 

V  .  j 

• 

( 1961 ' ) . 
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Now  because  the  columns 


*0'x1'  ■  ■  ‘  '*n-1 


form  a  group  closed  under  multiplication 


defined  such  that  the  product  column  x.  has  for  its  ut^1  lement  x.  =  x,,,x 

~ij  1  ju  i u  31 

any  such  product  column  must  be  a  column  of  the  original  array. 


elements 

of  a 

column 

V2  <X0  * 

Si” 

V2  (X0u  - 

r+i 

if 

X  . 

IU 

=  -1 

xiu' 

L  0 

if 

X  . 

IU 
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r° 

if 

X  . 

IU 

=  -1 

V2(X0u  + 

Kiu) 

»  < 

l+i 

if 

X. 

iu 

=  +1 

(2) 


Also  the  elements  of  a  column  V?  (x,  ±  x.  )  are 

^  ~ig 


Vi  ( x  -  x  )  »  Vt  x  ( x  -  x  ) 
*  ]u  igu  *  ju  Ou  iu 


V2  (X.  +  x,  )  -  V?X.  (x.  +  X.  ) 

*=  3  u  i3u  *■  3U  Ou  iu 


xju  if  xiu  "  _1 


0  if  x .  “  + 1 

iu 


’0  if  x.  -  -1 
iu 


x  if  x.  *  +1 
3U  iu 


(3) 


To  pee  how  this 

may 

be  used  to  study 

location  and 

dispersion 

aliasing 

consider 

illustration  the  2^ 

design. 
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to  compare  variances. 
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In  general  for  every 

i 

the 
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-  X 
~i*3 

)  will 

appear  in 

2  “ 

pairs 

identical  apart  from 

sign 

1.  Now  suppose  data  %  ; 

=  ty,- 

...,yu,..., 

V 

are 

available 
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3.2.  Elimination  of  Location  Effects 

The  sums  of  squares  in  (4),  (5)  would  be  appropriate  to  compute  dispersion  effects 
only  if  it  could  be  assumed  that  all  the  location  effects,  including  the  overall  mean, 
were  known  to  be  zero.  If  this  were  not  the  case  then  the  sums  of  squares  S(i-)  and 
S(i+)  could  be  inflated  by  location  effects.  To  remove  such  effects  we  can  replace  the 
yu's  in  (5)  by  residuals  yu  -  yu  obtained  after  eliminating  all  suspected  location 
effects,  including  the  mean,  by  least  squares.  Since  for  this  mode  of  estimation  the 
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vector  of  residuals  is  orthogonal  to  each  column  vector  corresponding  to  an  eliminated 


variable,  it  follows  that  the  identity  relation  for  a  sum  of  squares  calculated  from  such 
residuals  is  still  expressed  by  equation  (4)  but  with  all  estimated  contrasts 
corresponding  to  eliminated  variables  set  equal  to  zero. 

3.3.  Expected  Values  of  Sums  of  Squares  of  Residuals 

Further  understanding  is  gained  by  considering  the  expected  values  of  S(i-)  and 
S(i+)  under  various  circumstances.  Suppose  a  difference  in  variance  might  exist 
associated  with  the  level  of  the  single  column  j<^  and  the  sums  of  squares  S(i-)  and 
S(i+)  are  computed  from  (4)  but  with  yu  replaced  by  residuals  after  a  number  of 
location  effects  have  been  eliminated,  Wien,  after  setting  to  zero  all  the  elements  j 
and  i'j  in  (5)  which  correspond  to  eliminated  variables,  suppose  there  are  *  cases 
where  bracketed  pairs  (j,  i*j)  have  been  eliminated  and  m  cases  where  only  one  element 
of  a  bracketed  pair  has  been  eliminated  so  that  there  remains  y  -  t  -  m  complete 
bracketed  pairs. 

For  a  bracketed  pair 

{y  ( j  -  i’j)}2]  -  a2<i->  (6) 

and  for  a  single  element 

e£  {j  j}2]  »  j  (°2U->  +  ^ti+O  •  (7> 

It  follows  that 

E[s(i->]  »  <7  "  "  4  "  f  +  }  “02(i+)  (8) 

E[s(i+)J  -  (j  n  -  t  -  j-  m)o2(i+)  +  1  ma2(i-)  .  (9) 


If  we  now  define 

s2(i-)  =  S(i-)/(y  n-t-im)  ,  (10) 

then 

E[  s2  ( i- )  ]  -  d2(i-)  ♦  2n  .  ^  .  -2~  (a2(i+)  -  o2(i-)) 
and  similarly  for  s2(i+)  with  the  roles  of  o2(i-)  and  d2(i+)  reversed. 


(11) 


The  general  situation  may  be  better  understood  by  considering  a  few  special  cases 

again  using  for  illustration  the  8  *  8  factorial  array.  Setting  i  -  4  «  C,  suppose  we 

2  2 

wish  to  obtain  the  dispersion  effect  s  (4-)/s  (4+)  which  contrasts  the  variances  of  the 
first  four  and  last  four  observations. 

Elimination  of  Grand  Mean;  Elimination  of  the  mean  (which  would  usually  be  unknown) 
results  in  the  removal  of  0  in  equations  (4).  For  the  8  x  8  array j  n  »  8,  t  «  0, 
m  =  1 

32(4-)  =  (-^  (  (4 )  2  +  (1  -  5 )  2  +  (2  -  6 )  2  +  (3  -  7 ) 2 ) }  /<7/2) 
and  using  (11) 

e[s2(4-)J  =  02( 4- )  +  [o2<4+)  -  02(4- ) ] 

It  will  be  seen  that  the  slight  bias  in  the  variance  estimate  arises  because  the  isolated 
effect  4  is  a  function  of  all  eight  observations. 

Elimination  of  the  Mean  and  of  Effect  4  ;  If  the  location  effect  associated  with 
factor  4  is  eliminated  as  well  as  the  overall  mean  then  a  complete  pair  is  removed  in 
(5)  and  in  this  example 

s2(4->  =  { < 1  -  5 ) 2  +  (2  -  6 ) 2  +  (3  -  7 ) 2j  /3 

E[ s2 (4- ) J  =  O2 (4- ) 

No  bias  now  occurs  because  elimination  of  0  and  4  is  equivalent  to  eliminating  means 

2 

separately  from  the  first  four  and  the  last  four  observations,  and  s  (4-)  becomes  a 
function  of  only  the  first  four  observations.  Similar  effects  are  found  with  all 
bracketed  pairs.  Thus  if  we  eliminate  factor  2  and  the  interaction  2*4  =  6  the  bias 
term  does  not  appear  because  allowance  is  being  made  for  different  effects  of  factor  2  at 
the  two  levels  of  factor  4. 

In  the  circumstances  of  effect  sparsity  here  considered,  the  bias  term  in  (11) 

2  ,  2 

involving  o  ( i+ )  -  a  ( i- )  would  usually  be  rather  small.  For  example,  suppose,  with  a 


design  having  n  *  16  runs,  that  4  «  2  and  iu  “  1 ,  then  the  bias  term  will  be 
(a2 (r+  )  -  a2 ( i- ) }/22.  It  seems  reasonable  to  conclude  that  for  purposes  of  model 
identification  the  elimination  of  location  effects  by  simply  taking  residuals  is  unlikely 
to  mislead. 

However,  if  desired,  appropriate  linear  combinations  of  equations  (8)  and  (9)  will 
2  *2 

yield  unbiased  estimates  4  ( 1 — )  and  4  (i+)  as  follows: 

s2(i-)  -  s2(i-)  ♦  2n-.4;.4m  (s2u-)  -  s2(i+>; 

42<i+)  »  s2(i+)  +  - — ■  "  .  (s2(i+)  -  a  2(  1-  )  ] 

2n-44-4m 

3.S.  Dispersion  Interactions 

Since  more  than  one  dispersion  effect  might  be  present  we  need  to  consider  the 
possibility  of  interaction.  If  the  effect  of  changing  from  the  minus  level  to  the  plus 
level  of  a  factor  i  is  to  multiply  the  variance  by  irrespective  of  whether  the  plus 

or  minus  level  of  factor  j  is  employed  we  shall  say  that  there  iB  no  dispersion 
interaction  between  i  and  j.  In  such  a  case  the  variances  for  the  various  factor 
combinations  are  as  follows 


q2(i+,j+)  -  ♦it^a2 

q2U-,j-)  -  o2 

EBSBD3BI 

i  + 

Equivalently  for  the  logged  variances  the  dispersion  effects  will  be  additive  and  in  this 

metric  dispersion  interactions  of  all  orders  may  be  defined  in  the  usual  way.  It  shall  be 

noted  that  when  there  is  no  dispersion  interaction  the  ratio  of  the  average  variance  at 

the  plus  and  minus  levels  for  factor  i  is 

2  2  *  ( 1  +  *  )o2 

d  (it, j-)  +  0  (it,3+)  V  V  _  . 


a2(i-.j-)  +  o2(i-,j+)  <1  ♦  ♦la2  1 

and  similarly  for  factor  j  and  Thus  even  when  there  is  more  than  one  dispersion 

effect  the  simple  analysis  described  above  could  still  be  of  value  as  a  preliminary  analyti¬ 
cal  device  for  indicating  which  factors  needed  further  study.  In  particular  if  two  factors 
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1  and  j  appeared  to  exhibit  dispersion  effects,  then  further  analysis  would  be 
appropriate  to  consider  the  general  evidence  for  activity  of  these  effects  taking  account 
also  of  possible  interaction.  This  could  be  done  by  considering  general  differences  among 
the  sums  of  squares  associated  with  the  four  cells  Sti-,j-),  S(i-,j+),  S(i+,j-),  S(i+,j+) 
of  the  two-way  table  for  the  two  factors.  As  before  these  sums  of  squares  would  be 
calculated  from  residuals  after  eliminating  location  effects.  The  consequences  of  doing 

this  is  explored  in  the  Appendix  which  gives  a  matrix  generalization  of  earlier  results. 

2  2 

A  convenient  function  for  comparing  a  set  of  variances  s1#...,s^  having 
degrees  of  freedom  respectively  is  Bartlett's  criterion, 

_1  k  2  k  2  k 

M  -  N  tn(N  v  s  )  -  )_  v  In  s  ,  where  N  «  \  V  . 

t-1  1  1  t-1  *  t-1  Z 

When,  as  would  frequently  be  the  case,  the  screening  design  is  of  only  moderate  size  one 
could  not  expect  to  study  simultaneously  a  large  number  of  factors  in  this  way.  For 
example,  for  n  «  16,  the  individual  cells  from  which  S { i— , j— )  S(i-,j+),  etc.  would  be 
calculated  will  each  contain  only  four  observations.  However  when,  in  circumstances  of 
effect  sparsity,  only  a  very  few  such  effects  are  likely  to  be  of  appreciable  magnitude, 
the  above  analysis  could  be  of  value. 

We  again  illustrate  with  the  welding  data.  Figure  3(a)  shows  the  35  distinct  values 
of  M  computed  for  the  data.  There  are  (j5)  “  '05  ways  of  choosing  two  columns  from 

the  15  columns  of  the  design  but  these  are  aliased  in  sets  of  three  (any  column  is  the 

product  of  two  other  columns).  Thus  the  largest  value  is  associated  with  columns  15  *  C, 

2  =  H,  and  13  =  J.  This  effect  could  equally  well  be  attributed  to  factors  C  and  H 

with  interaction  -CH  =  J  or  to  C  and  J  with  interaction  -CJ  =  H  or  to  H  and  J 

with  interaction  -HJ  »  C.  It  is  noteworthy  however  that  the  seven  largest  values  of  M 

which  stand  out  from  the  rest  all  include  factor  C  in  their  triplets.  Also  if  the 
dispersion  effect  of  C  is  eliminated  by  rescaling  the  residuals  the  plot  (Fig.  3(b))  no 
longer  shows  outstanding  points. 
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4.  MAXIMUM  LIKELIHOOD  ESTIMATES  OF  LOCATION  AND  DISPERSION  EFFECTS 


Once  a  model  has  been  Identified  a  more  precise  fitting  is  possible  using  maximum 
likelihood.  Hartley  and  Jayatillake  (1973)  have  shown  that  the  following  method  will  give 
convergence  to  a  stationary  point  of  the  likelihood.  Conditional  on  the  dispersion 
effects,  location  effects  may  be  obtained  by  weighted  least  squares;  the  dispersion 
effects  may  now  be  recomputed  from  the  residuals  and  the  iteration  continued  until 
covergence  is  achieved.  It  is  often  convenient  to  assume  initially  that  there  are  no 
dispersion  effects. 

For  illustration  the  following  table  shows  maximum  likelihood  estimates  for  the 
welding  data  assuming  location  effects  for  B  and  C  and  a  dispersion  effect  for  C. 

The  earlier  approximate  estimates  are  indicated  for  comparison. 

o  2(C-) 

A  A  A  q  A  A  A  A 

U  B  -Co  ( C- )  o  (C+)  a  (C+) 

Maximum  likelihood  estimates  42.96  2.04  3.10  .469  .021  22.3 

Earlier  approximate  estimates  43.00  2.15  3.10  .564  .031  18.2 


Table  2.  Estimates  of  location  and  dispersion  affects:  welding  data. 


Appendix 

The  results  of  section  3  can  be  generalized  to  two  variables  using  matrix  algebra. 

Again,  X  is  a  matrix  of  ±1's  with  orthogonal  columns  , . . . >Xn_ 1 >  X  is  the  vector 

of  observations,  and  j  =  x.y.  Define  I(*)  to  be  a  n  x  n  diagonal  matrix  with  I's  in 

~3 

those  rows  for  which  the  condition  *  is  true,  0's  elsewhere.  Then,  for  example, 

I(i-)  is  diagonal  with  I's  in  rows  where  =  -1;  I(i-,j-)  is  diagonal  with  I's  in  rows 
where  x^  =  x^  =  -1;  I  is  the  identity  matrix. 

The  following  four  identities, 


S(i±)  =  (lUDxJ'UUt)*) 


V*  "  U  -  2i(i-)Jx 


l  -  £<  i->  +  i<i+> 


imply,  after  some  algebra,  the  previously  shovm  identity. 


«  n  i  *  *  2 

s(i-)  >‘3  "  i,j)  ’ 
j=0 

Extending  to  two  variables,  the  additional  identity 
S(i±,j±)  -  ( I(  i± ,  '  ( I(  i±,  3±>x) 


implies 


«  n™  1a  a  ^  a  « 

S(i-,j-)  «  — -  l  (k  -  i*k  -  j  *k  +  i«D*k) 

1  Oil  «  « 

k*0 


4  I  *  A  A  A  , 

S(i-,j+)  »  -TT—  \  (k  -  i*k  +  j*k  -  i*  j  *k) ‘ 


S(i+,j-)  l  (k  +  i-k  -  j-k  -  i*j»k)2 

k-0 

S(i+,3+)  -  l  (k  +  i*k  +  j«k  +  i*j*k)2  . 

k-Q 

Now  to  compute  expectations  of  these  quantities,  suppose  that  Z  is  the  n  x  p 
matrix  of  columns  of  JC  which  correspond  to  location  effects  included  in  the  model,  i.e. 
eliminated  to  obtain  residuals.  Then,  assuming  E[^]  =  Zr,  and  using  the  identity 


Elx’&Xl  *  trace  (&E[juc' l) 


for  A  symmetric. 


E[S(i-,3-)l 


n  “  2p  2 


(i-,j-)  +  — -  \  trace  [zz '!( i-, j- )ZZ ’ I ( i±, j± ) j a2 ( i±, j  + ) 


where  summation  is  over  the  four  possible  combinations  of  i±,j±.  To  compute  the  trace  in 
the  above  expression,  we  divide  the  columns  of  g  into  four  groups: 


Group  4: 


Those  columns 


such  that 


and 


are  also  in  Z. 


Group  3 : 


Group  2 : 


Group  1 : 


such  that  2S j  -£k  and  2Si  *2S  j -v5k 

Those  columns  z  such  that  exactly  two  of  x.  ,  x  •  z.  ,  x  «x  •  z 
are  also  In  J.. 

Those  columns  z,  such  that  only  one  of  x.»z,  .  x.«z,  ,  x.«x.*z, 

~k  ~i  ~k  ~ j  ~k  ~i  ~j  ~k 

is  also  in  g  . 

Those  columns  not  in  previous  three  groups. 


Let  mk  «  number  of  columns  in  group  kj  note  that  mk  is  a  multiple  of  k.  Further 
subdivide  group  2  into  three  subsets 


Group 

2.1: 

those  pairs 

Z  ,  Sir 
g  k 

with 

Sg 

•  Zi'Sk 

Group 

2.2: 

those  pairs 

~g  *  ~k 

with 

Sg 

-  v*k 

Group 

2.12: 

those  pairs 

V  *k 

with 

Sg 

-  Xi-Zj-tK 

Let  k  be  the  number  of  columns  in  group  2.k;  ntj  “  +  ^2,2  +  m2  12* 

Then  after  some  algebra. 


_  4n  -  7p  +  3m.  +  2m,  +  dl 

e[  S( i- , j- )  J  -  o2,i-,j-)[ - ^ - !j 


,  p  -  Hi  •  r  m,  -  m.  +  2m_  , 

♦  o2(i-,j+)[ - *■  3Y63  2 - 


2..  ,  ,,P  "  "4  "  3  “3  _  m2  +  2 m2 . 2 1 

+  o  (l+,J-)l - ^ - J 


+  o2u+,j>)[ — 4  3  ;6-  2 — 


Note  that  S(i— ,j— )  will  be  unbiased  (up  to  a  scale  factor)  if  all  columns  of  Z  are  in 

group  4  (p  *  m^,  m3  “  i»2  “  m1  “  0)  i.e.  for  each  variable  x^  eliminated,  variables 

x  x.  ,  x  are  also  eliminated.  Similar  expressions  for  expectations  of 

Kf  *  K  "*1  *  J  *  K 

S(i-,j+),  S(i+,j-)  and  S(i+,j+)  can  be  worked  out  quite  easily  from  the  above  formula 


by  switching  signs  on  i±,j±.  In  particular,  the  expression  for  E(S(i-)j  derived  in 
section  3.3  follows  immediately  with 


o2(i-)  =  1  lo2(i-,j-)  +  o2(i-,j+>) 
o2(i+)  =  1  (o2  (  i+,  j- )  +  o2(i+,j+))  . 

Thus  the  calculation  of  estimated  dispersion  effects  defined  as  the  change  in  the  average 
variance  is  not  affected  by  the  existence  of  more  than  one  real  dispersion  effect. 
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